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29751 bp RNA 
TOR2, complete genome. 



LOCUS AST274119 29751 bp RNA linear VRL 

24 -MAR.- 2 004 
DEFINITION SARS 
ACCESSION AY274119 
VERSION AY274119.3 GI: 30248028 

KEYWORDS 

SOURCE SARS coronavirus Tor2 

ORGANISM 3 : nML..-: : V';.'^::ic;.nn;L.:I::?j' : ; : 

Viruses; ssRNA positive -strand viruses, no DNA stage,- 

Nidovirales; 

Coronaviridae; Coronavirinae ; Betacoronavirus . 
REFERENCE 1 (bases 1 to 29751; 

AUTHORS He , R . , Dobie,F., Ballantine.M. , Leeson.A., Li , Y . , 
Bastien, N . , 

Cutts,T., Andonov, A. , Cao,J., Booth, T.F., Plummer, F.A. , 

Tyler , S. , 

Baker, L. and Li,X. 
CONSRTM BCCA Genome Sciences Centre, British Columbia Centre 
for Disease 

Control and National Microbiology Laboratory Canada 
TITLE Analysis of multimerization of the SARS coronavirus 
nucleocapsid 

protein 

JOURNAL Biochem. Biophys . Res. Commun . 316 (2), 476-483 (2004) 
PUBMED l&OHMi?.. 
REFERENCE 2 (bases 1 to 29751! 

CONSRTM BCCA Genome Sciences Centre, British Columbia Centre 
for Disease 

Control and National Microbiology Laboratory Canada 
TITLE Direct Submission 

JOURNAL Submitted (13 -APR- 2 003) Genome Sciences Centre, British 
Columbia 

Cancer Research Centre, 600 West 10th Avenue, 
Vancouver, BC V5Z 

4E6, Canada 
REFERENCE 3 (bases 1 to 29751) 

CONSRTM 8CCA Genome Sciences Centre, British Columbia Centre 
for Disease 

Control and National Microbiology Laboratory Canada 
TITLE Direct Submission 

JOURNAL Submitted (23 -APR-2003 ) Genome Sciences Centre, British 
Columbia 



"Cancer Research Centre, 600 West 10th Avenue, 
Vancouver, BC V5Z 

4E6, Canada 
REMARK Sequence update by submitter 
REFERENCE 4 (bases 1 to 29751) 

CONSRTM BCCA Genome Sciences Centre, British Columbia Centre 
for Disease 

Control and National Microbiology Laboratory Canada 
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TITLE Direct Submission 

JOURNAL Submitted (30-APR-2003) Genome Sciences Centre, British 
Columbia 

Cancer Research Centre, 600 West 10th Avenue, 
Vancouver, BC V5Z 

4E6, Canada 
REMARK Sequence update by submitter 

this sequence version replaced 

FEATURES Loca t i on / Qua 1 i f i a r s 
source 1, .29751 

/organism- "SARS coronavirus Tor2" 
/mol_type= " genpral c RNA " 

/isolation _soiirce= "patient #2 with severe acute 
respiratory syndrome (SARS) " 

t=i on ' 
/country^ " Canada : Toronto" 
CDS join (265. .13392, 13392, .21485) 

/ribosomal_slippage 
/codon_start=l 
/product="replicase 1AB" 

1 - 0 
/db_xre£="GI : 30795144" 

/ translation- "MESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGD 

SVEEALSEARE 



PubMed (weighted) 

RefSeq nucleotide for species 

Taxonomy 



LVAEMDGIQYGRSGI 

TLGVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPiEDYEQN 

WNTKHGSGALRELTRELNGGAVTRYVTjNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQ 

LDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFP 

LNSKVKVIQPRVEKKKTEGFMGRIRSVYPV/ ! JHLSTLMKCNHCDEVSWQTC 

DFLKATCEHCGTENLVIEGPTTCGYLPTNAWKMPCPACQDPEIGPEHSVADYHNHSN 

IETRJjRKGGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGJiTOITGDNVETLI'JEDL 

LEILSRERVNINI VGDFHLNEEVAI ILASFSASTSAFIDTIKSLDYKSFKTIVESCGN 

YKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFAKTLD/iANfiSIPDLQRAA 

VT I LDG I S EQS LRLVDAMV YTS DLLTNS V 1 1 MAYVT8G^¥i3lQ^lSQ!^SNLLGTTVEKL 

RP I FEW I EAKLSAGVEFLKDAWE ILKFLITGV F'D I V XGQ I QVASDNI KDCVKCFIDW 

NKALEMCIDQVTIAGAKLRSLNLGEVFIAQSKGLYR^ 1 , 0 3 " LKAPKEVT 

FLEGDSHDTVLTSEEVVLKKGELEALETPVTJSFTNGAIVGTPVCVNGLMLLEIKDKEQ 

YCALSPGLLATKKVFRLKGGAPIKGVTFGEDTVWgVQGYKm^ITFELDERVDKVLNE 

KCSVYTVESGTEVTEFACWAEAWKTLQPVSDLLTNMGIDLCEWSVATFYLFDDAGE 

ENFSSRMYCSFYPPDEEEEDDAEC ! E3 / OGLPLEFGASAETVE 

i ilEEEEEDWL J A K 1 LI SAQSA 

NPNP/IVNAA1n'IHLKKGGGVAGALNKATNGAN5QKESDDY1KLNGPLTVGGSCLLSGHNL 

AKKCI;HWGPNLNAGEDIQ:..I j KAAYENFNSQDTIjIjAP1;LSAGIFGAKPLQSLQVCVQT 

VRTQVYIAVNDKALYEQWMDYLDNLKPRVe" . " S7VQKPVDV 

KPKIKACIDF "T , 3 " 1 ' 

VGDVITSGDITCVVIPSKKAGGTTEMLSRALKKVPVDEYITTXPOQGCAGyTLESJVKT 

ALKKCKSAFYVI.PSEAPKAKEEILGTVSWNLREHLAKAEETRKLMPICMDVRAIMATI 
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QRKYKGTKIQEGI VDYGVRFFFYTSKEPVAS T ITKTjNSIjNEPLVTMPTGYVXHGP'NLE 
R 1 PA DAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSG 

RTEL I t H LDGEVL S LDKLKS LLS LREVKTI KVFTTVD 

YLDGAJ P EGKTF'F L O^-^SFAFE 

YYH'XiiL'i: -i ' / _ ! £ ALQ2LEVKFN 

APAi^EMYP^RAGDAAKFCALIIJiYSNKTVGELGDVRETWTHLLQHA^ESAKRVLN 
VVCKKCGQKTFI'L'rG VEAVMYMGTLS YDKLKTGVS I PCVCGRDATQYIiVQQESSFVMM 
INEY GNYQCGHYTHITAKETLYRIDGAHLTKM8EYKGPVTD 

FYKE i C GV TYTE I E PKLDGY YKKDNAY YTEQPIDLV PTQPLPNA 

SFDHFKLTCSOTKFAPC^QMTGFTKPASRELSVTFFPDLNGDVVAIDYRHYSASFKK 
GAKIJ-iHKPX W'K.TNOATTKTTFKPNTWCLRCLWSTKPVDTSNSFEVIiAVEDTQGMDNL 
- 1 - EVIECDVKTTEWGNVILKPSDEGVKVTQELGHEDLMA 
AYVBNTSITIKKPNELSLALGLKTIATHGIAAINSVPWSKILAYVKPFLGQAAITTSN 
- i 1 MPYVFTLLFQLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLCLDA 

GINYVOPKFSKLFTIAMWLLLI.SICLGSLICVTAAFGVLLSNFGAPSYCNGVREIA'L 
' IMDI 3GSPPCSICLSGLDSLDSYPALETIQVTISSYKLDLTILGLAAEWVL 

AYMtiFTKFFYIiLGIjSAlyiQVFFGYFAStiFISNSWLMWFIISIVQMAPVSAMVRMYIFF 
AS FYYIWKS YVH SMDGCTSSTCMMCYK'RNRATRVECTTI VNGMKRS F YVYANGGRGFC 
KTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVAVKKGALHL 
YFDKAGQKTYERHPLSHFVNLDNLRANNTKGSIwPINVIVFDGKSKCDESASKSASVYY 
SQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSE 
LAKGVALDGVLSTFVSAARQGWDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFMLTY 
NKVENMTPRDLGAC I DCNARH INAQVAKSHNVSL IWNVKD YMSI.SEQLRKQ I RSAAKK 
NNIPFRLTCATTRQV^NVITTKISLKGGKIVSTCFKI^LKATLLCsTI^VALVCYIVMPV 
HTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQRGG5YKNDKSC 
PVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVOJICYTPSKLIEYSDFA 
TSACVJjAABCTI FKDAMGKPV PYCYDTNLiLEGS I S YSELRPDTRYVI*MDGS i IQFPNT 
YLEGSVRVVTTFDAEYCRHGTCERSEVGICLSTSGRWVLNNEHYRAI<S0VFCGVDAMN 
LIANIFTPLVQPVGALDVSASWAGGIIAILVTCAA * FF NHWAANAL 
LFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDX I I 

ATYVFCISLKHCKWFFNNYLRKRVNIFKGVTFSTFEE&ALCTFLLNKEMYLKLRSETLL, 
PLTQYNRYLALYNKYKYFSGALDTTSYREAACCHLAKAIi? V. ' /QPFQTSI 

MYEDLL rRKS^HSP^jVQAG^yQijKVIGHSMQWCLLRLXVDTSI^FKTPKYKFVR IQPGQ 
TFSVLACYNGSPSGVYQCANIRPffifTIKGSFLNGSCGSVGFNIOYDCVSFCYMHHMELP 
TDL.EC4 F 2TRC n " LAW J 1 J f TTI 
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iYEPLTQDHVDILGFLSAQTGIAVLDMCAALKELLQNGMNGRTILGS 
TILEDEFTPFDWRQ 3 E i W J 'LLTFLTSLL I LVQSTQWSLFF F 

WENAFLPFTLGIMAIAACAMLLVKHKKAFLCLFLLPSLATVAYF^jm r YMPASWVMRI 
MTWLELADTSIjSGYRLFCDCVMyASALVLLILMTARTVYDDAARRWTLMNVITLVYKV 
IALVISVTSRYSGWTTIMFLARAIVFVCVEYYPLLFITGNTLQCI 
MLVYCFLGYCCCCYFGLFCLLSRYFRLTLGVYDYLySTQEFRYMNSQGLLPPKSSIBA 
FK^IKLLGIGGKPCIKVATVQSKMSDVKCTSWLLSVLQQLRVESSSKIjWAQCVQL.ti 
NDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINRLCEEMLDNRATLQAIASEFSSLPS 
YAAYATAQEAYEQAVANGDSEWLKKLKKSLNVAKSEFDRCAAMQRKLEKMADQAMTQ 
MYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLBIIPLTTAAK 
LMVW PD YGT YKNTCDGNTFT YAS ALW E I QQ VVDADS K I VQLSE INMDNS PNLAW PL I 
VTAU^SAVKLQNNELSPVALRQMSCAAGTTQTACTDDNAIAYY^SKGGRFVIAL,L 
SDHQDLKWARFPKSDGTGTIYTELEPPCRFV'I-DTPKGPKVKYLYFIKGiA^LJSiRGMVL 
GSLAATVRXjQAGNATEVPAijSTVLSFCAFAVDPAKAYKDYLASGGQPIl^CyKMLClri 
TGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCAND 
PVGFTLRNTVCTVCGMWKGYGCSCDQLREPLMQSADASTFFKRVCGVSAARLTPCGTG 
rSTOVVYRAFDIYNEKVAGFAKFLKTNCCRFQEKDEEGNLLDSYFVVKRHTMSNYQHE 
ETIYNLVKDCPAVAVHDFFKFRVDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDT 
LKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQSLLKTVQFCDAMRDA 
GIVGVLTLDNQDLNGNWYDFGDFVQVAPGCGVPIVDSYYSIJjMPILTTjTRALiAAESHM 
DADLAKPLIKWDLLKYDFTEERLCLFDRYFKYWDQTYHPNCINCLDDRCIIiKCA^3FN\' 
LFSTVFPPTSFGPLVRKIFVDGVPFWSTGYKFRELGWHNQDVNLHSSRLSFKELLV 
YAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKE 
GSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIROLLFWEWDKYFDCYDGGCIN 
ANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYA 
ISAKNR/lRTVAGVSICSTMTNRQFHQKLLKSIAATRGAWVIGTSKFYGGWHMvILKTV 
YSDVETPH 1 t ! PNMLRIMASLVLARKHmX-CNLSHRFYRLANECAQVL 

£ ;■ MVMCGGSIjYVKPGGTSSGDATTAYANSVFNICQAVTAKVNKLLSTDGNKIADKYVR 
i^QHRLYECLYRNRDVDHE^DEFYAYLRKHFSMMILSDDAWCYNSNYAAQGLVASI 
KNFKAVLYYQJMVf'MSEAKCWTSTDLTKGPHEFCSQHTMLVKQGDDYVYLPYPPiPSRI 
LGAGCF^/DDIVKTDGTLMIERFVSIAIDAYPLTKKPNQE^ADVFHLYLQYIRKLKDEL 
TGHMLD^YSVMLTKI)NTSRY^EPEFYEAFiYTPKTVLQAVGACVLGNSQTSLRGGAGlR 
RPFLCCKCCYDHVISTSHKLVLSrapWCNAPGCDV'TDVTQLYLGmSYyCKSHKPPI 
SFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCD^^ 

LKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPIJ^YVFTGYKVTKNSKVQI 
GEYTFEKGDYGDAVVYRGTTTyKTJvVGBYFVLTSHTVMPLSAPTLVPQKHYVRITGLY 
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PTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSH 

AAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQyVFCTVNALPETTRD 

IWFQEISMAraYBLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCR 

LMKTIGPDMFLGTCRRCPAEIVOTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVS 

SAIKmPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDY 

VIFTQTTETAHSCNVNRFNVAITRAKIGILCIMSDRDLYDKLQFTSLEIPRRNVAT5U3 

AENVTGLFKDCS KI I TGLH PTQAPTHLSVD I KFKTEGLCVD I PG I PKDMTYRRLISMM 

GFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLGFSTGVN 

LVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNWRIKIVQMLSDTL 

KGLSDRWFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNHSVG 

FDYVYNPFM ' . iDQK :QVHGNAHVASODAIMTRCIAVHECFVKRV 

DWSVEYPIIGDELRVNSACRKVQHMWKSALLADKFPVLHDIGNPKAIKCVPQAEVEW 

KFYDAQPCSDKAYKIEELFYSYATHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVL 

SNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYi'SDSPCESHGKQVVSDID 

YVPLKSATCITRCNLGGAVCRHf-lANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNT 

FTRLQSLENVAYNVVNKGHFDGHAGEAPVS 1 1 NNAVYTKVDG IDVE I FENKTTL PVNV 

AFELWAKRNIKPVPEIKILNNLGVDIAANTVIWDYKREAPAHVSTIGVCTMTDIAKKP 

TESACSSLTVLFDGRVEGQVDLFRNARNGVLITEGSVKGLTPSKGPAQASVNGVTLIG 

ESVKTQFNYFKKVDGIIQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEFIQRYKL 

EGYAFEHIVYGDFSHGQLGGLHLMIGLAKRSQDSPLKLEDFIPMDSTVKHYFITDAQT 

GSSKCVCSVIDLLLDDFVEIIKSQDLSVISKWKVTIDYAEI3FMLWCKDGHVETFYP 

KLQASQAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYLN 

TLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIG 

DCATVHTANKWDLI I SDMYDPRTKHVTKENDSKEGFFTYLCGFIKQKLALGGS IAVKI 

TEHSWNADLYKLMG - iS S S EAFLIGANYLGKPKEQ IDGYTMHANY I 

FWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQINDMIYSLLEKGRI.IIRENNR 
VWSSDILVNN" 
CD; : ; ; 21492. .25259 

/codon_start=l 

/product^ " spike glycoprotein" 
/db_xref ="GI : 3073S145" 

/translation- " MF1 F'LLFLTLTSGSDLDRCTTF'DDVQAPNYTQH 

TSSMRGVYYPD 

EIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFGNPVIPF30DGIYFAATEKSNVVRGWV 
FGSTMNKKS0SVIII^;STNVVIRACNFET:CDNPFFAV5KPMGT0THTMIFDNAFNCT 
FEYISDAFS 1 P - - - 

IFKLPLGINITNFRAILTAFSPA 3 - TTFMLKYDENGTITDAV 

DCSQNPIAELKCSVKSFEIDKGIYQTSKFFA^PSGDWSFPNITl'lLCPFGEyFNATKF 
PSVYAWERKKISNOTADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFVVKGD 
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DVRQIAPGQTGVIftDYKYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGKLRP 
FBRDISNVPFSPDGKPCTPPAI^CYWPIjroYGFYTTTGIGYQPYRVVVLSFELLNAPA 
rva DL 2C ^ x R'-QPFQQFGRDVSDFTDSVRDPK 

TSEILDISPCftFGGVSVlTPGTNASSBVAVLYQDVNCTDVSTAIKftDQLTPAWRIYST 
L RSTSQKSIVAYTMSLG 

ADSSIA.YSNNTIAI PTi^F'SiS ITTEVMPVSMAK'i'SVDCNf'Vi' iCGDSTECANLLLQYGS 

^nralsci; ; . 1 t : ; r r _ sqilpp pli p r; f s 

FIEDLL C LPPL 

AAPV3GTATAGWTFGAGAALQI PP'APIQ^iPRFPKIGVTQK^^^ 

^NAQA^M LVK QLS SNFGA I S S VLND I LS RLDKVEAE 
VQICRLITGRI J 0SLC r «VT0QLIRAAEIRASANIAATi^5SEC\T J GQSKRVX)FCGXGYH 
P P GI SWFITQ 

> ' r PI? NTVYDPLQPELDSFKEELDKYFK3JHTSPDV 
DLGD2SGINASVVNIQKEIDRLNEVAiCNLNE3LIDLQELGKYEQYIKWPWYVWijGFIA 
GLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLKYT" 

/product ="Orf 3" 
/protein_id- -MA? ■■; J. , ' » 

/db_xref=»GI: 30795146" 

/ translation= "MDLFMRFFTLRSITAQPVKIDNASPASTVHATA 

TIPLQASLPFG 

WLVIGVAFIAVFQSATKIIALNKRWQLALYKGFQFICNLLLLFVTTYSHLLLVAAGME 

AQFLYLYALIYFLQCINACRIIMRCWLCWKCKSKNPLLYDANYFVCWHTHNYDYCIPY 

NSVTDTIWTEGDGISTPKLKEDYQIGGYSEDRHSGVKDYVWHGYFTEVYYQLESTQ 

ITTDTGIENATFFIFNKI/7KDPPOTQIHTTDGSSGVANPAMDPIYDEPTTTTSVPL f ' 
CM 25689. ,26153 

/codon__atart=l 
/products "Orf 4" 
/protein_id= " v : .1 0 > • . . - 
/db xref ="GI : 30795147" 

/translations "MMPTTLFAGTHITMTTVYHITVSQIQLSLLKVT 

AFQl-'QNSKKTT 

KLWILRIGTQVLKTMSLYMAISPKFTTSLSLHKLLQTLVLKMLHSSSLTSLLKTHRM 

CK YTQSTALQELL IQOW I QFMMS RRRLLACLCKH KKVSTNLCTHSFRKKQVR " 
C'x : | 26117.. 26347 

/codon_start=l 

/product^ "small envelope E protein" 

tr ^_ 1 

/db_xref="GI: 30795148" 

/translations "MYSFVSEETGTLIVNSvO.LFLA.FWFLLVTLAI 

LTALRLCAYCC 

NIVNVSLVKPTVYVYSRVKNLNSSEGVPDLLV" 
/ codon_start= i 

/products "membrane glycoprotein M" 

dt l ei 

/db_xref="Gl : 30795149" 

/translation-" MADNGTITVEELKQLLEQWNLV1GFLFLAWIML 

LQFAYSNRNRF 
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LYIIKLVFLWLLWPVTIACFVLAAVYRI^VTGGIAIAMACIVGLMWLSYFVASFRLF 

ETNILiLNVPLRGTIVTRP^ E I F CHLRMAGHSLGRCBI 

KDLPKE I t A ^NDNIAL 

LVQ" 

CSS 27074 . .27265 

/codon_start=l 

/product="Orf7" 

/ prate i.n id " AA >'i ; . • >• 

/ db_xre f = " G 1 : 30795150" 

/ trans 1 at ion - "MFiiliVDFQVTIAEILI IIMRTFRIAIWNLDVI. I 



SSIVRQLFKPL 



TKKNYSELDDEEPMELDYP" 
27273 . .27641 

/codon_start=l 
/product- "Orf 8" 
/proue in_id= " AA*«J : 0<i 3 . jl 
/db_xref="GI:3079S151» 

/translation^-' MKI II.FI "LIVFTSCFI.YHYQBC/PGTTVLLKE 



LIVAM,VK!.I JXFTIKSKTfc » 
C.:.:S 27638.. 27772 

/codon_start=l 
/product="Orf 9" 
/protein_id=" AA' X<N ■■ . ; » 
/db_xref - "GI : 30795152" 

/translation- ''MNELTLIDFYLCFLAFLLFLVLIMLIIFWFSLE 

I QDLEEPCTKV " 

COS 27779.. 27898 

/codon_start=l 
/product="Orf 10" 
/protein_id« * ' ■ "- . ■ " 
/db_xref =»GI : 30795153" 

/ trans la tion= "MKLLi VLTCISLCSCIC'i"WQRCASNKPHVLED 

PCKVQH" 

CDS 27864.. 28118 

/codon_start-l 
/product="Orf 11" 
/protein_id= " ^AF1 : : 0.?. ; , . .1 " 
/db_xref="GI : 30795154 " 

/ translation " MCLKILVRYNTSGNTY STAWLCALGKVL PFHRW 

HTMVQTCTPNV 

T 1 NCQD PAGGAL I ARCW YLHEGHQTAAFRDVLVVLN KRTN " 



/translatic 

NGARPKQRRPO 

GLPNNTASWFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKM 

KELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGAIJSPrPKDHIGTRNPNNNAATVL 

QLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGET 

ALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAES ' QYNVTQAFG 

RRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTY 

HGAIKLDDKDPQFKDNVILLNKHIDAYKTFPPTEPKKDKKKKTDEAQPLPQRQKKQPT 
VTLLPAfV.OMDDFSRQ 1 ^ A - ,A 

CDS 28130.. 28426 

/codon_start=l 
/product- "Orf 13" 
/ p r o t e i n_ i d = " A."j i;l 1 .0 4 > : . ;. " 
/db_xref - " GI:307 95156° 

/translation« 'KDX'MQTNVVPPALHLVDPQIQLTITRMEDAMGQ 
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GQNSADPKVYP 

IILRLGSQLSLSMARRNLDSLEARAFX2STPIVVQ^^ 
CDS 28583 , .28795 

/product- "Orf 14" 
/protein_id» ■' - 

/dO_xief- GI ' . 

/translation* "MLPPCYNP , v VVKPLIAP 

HHWAVIQEIQ 

LLAAVGE I LLLEWLAEWKLPSRYCC " 

ORIGIN 

1 atattaggtt tttacctacc caggaaaagc caaccaacct cgatctcttg 
tagatctgtt 

61 ctctaaacga actttaaaat ctgtgtagct gtcgctcggc tgcatgccta 

gtgcacctac 

121 gcagtataaa caataataaa ttttactgtc gtcgacaaga aacgagtaac 
tcgtccctct 

181 tetgcagact gcttacggtt tcgtccgtgt tgcagrcgat catcagcata 
cctaggtttc 

241 gtccgggtgt gaccgaaagg taagatggag agccttgctc ttggtgtcaa 
cgagaaaaca 

301 cacgtccaac tcagtttgcc tgtccttcag gtcagagacg tgctagtgcg 
tggcttcggg 

361 gactctgtgg aagaggccct atcggaggca cgcgaacacc tcaaaaatgg 
cacttgtggt 

421 ctagtagagc tggaaaaagg cgtactgccc cagcttgaac agccctatgt 
gttcattaaa 

481 cgttctgatg ccttaag 3 j - • tgc cacaaggccg ttgagctggt 
tgcagaaatg 

541 gacggcattc agtacggtcg tagcggtata acactgggag tactcgtgcc 
acatgtgggc 

601 gaaaccccaa ttgcataccg caatgttctt cttcgtaaga acggcaacaa 
gggagccggt 

661 ggtcatagct atggcatcga tctaaagtct tatgacttag gtgacgagct 
tggcactgat 

721 cccattgaag attatgaaca aaactggaac acuaagcatg gcagtggtgc 
actccgtgaa 

781 ctcactcgtg agctcaatgg aggtgcagtc actcgctatg tcgacaacaa 
tttctgtggc 

841 ccagatgggt accctcttga ttgcatcaaa gattttctcg cacgcgcggg 
caagtcaatg 

901 tgcactcttt ccgaacaact tgattacatc gagtcgaaga gaggcgtcta 
ctgctgccgt 

961 gaccatgagc atgaaattgc ctggttcact gagcgctctg ataagagcta 
cgagcaccag 

1021 acacccttcg aaattaagag tgccaagaaa tttgacactt tcaaagggga 
atgcccaaag 

1081 tttgtgtttc ctcttaactc aaaagtcaaa gtcattcaac cacgtgttga 
aaagaaaaag 

1141 actgagggtt tcatggggcg tatacgctct gtgtaecctg ttgcatctcc 
acaggagtgt 

1201 aacaatatgc acttgtctac cttgatgaaa tgcaatcatt gcgatgaagt 
ttcatggcag 

1261 acgtgcgact ttctgaaagc cacttgtgaa cattgtggca ctgaaaattt 
agttattgaa 

1321 ggacctacta catgtgggta cctacctact aatgetgtag tgaaaatgcc 
atgtcctgcc 

138; tglcaagacc cagagattgg acctgaycac aqtgttgcag attalcacaa 

1441 attgaaactc gactccgcaa gggaggtagg actagatgtt ctggaggctg 
tgtgtttgcc 

150.; tatgttggct gctataalaa gcgtgcelac tgggttccuc gtgctagtgc 
tgatattggc 

1561 tcaggccata ctggcattac - t^aatqaorqa 
tctccttgag 

1621 atactgagtc gtgaacgtgt taacattaac autgttggcg attttcattt 
gaatgaagag 

1681 gttgccatca ttttggcatc tttctctgct cctacaagcg cctttattga 
cactataaag 

1741 agtcttgatt acaagtcttt caaaaccatt gttgagtcct gcggtaacca 
taaagttacc 
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1801 aagggaaagc ccgtaaaagg tgcttggaac attggacaac agagatcagt 

tttaacacca 

1861 ctgtgtggtt Ctccctcaca ggctgctggt gttatcagat caatttttgc 
gcgcacactt 

1921 gatgcagcaa accactcaat tcctgatttg caaagagcag ctgccaccat 
acttgatggt 

1981 atttctgaac agtcattacg tcttgtcgac gceatggttt atacttcaga 
cctgctcacc 

2041 aacagtgtca ttattatggc atatgtaact ggtggtcttg tacaacagac 
ttctcagtgg 

2101 ttgtctaatc ttttgggcac tactgttgaa aaactcaggc ctatctttga 
atggattgag 

2161 gcgaaactta gtgcaggagt tgaatttctc aaggatgctt gggsgatfcct 

caaatttctc 

2221 attacaggtg tttttgacat cgtcaagggt caaatacagg ttgcttcaga 
taacatcaag 

2231 gatt-.9t.9taa aatgcttcat tgatgttgtc aacaaggcac tcgaaatgtg 
cattgatcaa 

2341 gtcactatcg ctggcgcaaa gttgcgatca ctcaacttag gtgaagtctt 

24 01 agcaagggac tttaccgtca gtgtatacgc ggcaaggagc agctgcaact 
actcatgcct 

2461 cttaaggcac caaaagaagt aacctttctt gaaggtgatt cacatgacac 
agtacttacc 

2521 tctgaggagg ttgttctcaa gaacggtgaa ctcgaagcac t.cgagacgcc 
cgttgatagc 

2 581 ttcacaaatg gagctatcgt tggcacacca gtctgtgtaa atggcctcat 
gctcttagag 

2641 attaaggaca aagaacaata ctgcgcattg tctcctggtt tactggctac 
aaacaatgtc 

2701 tttcgcttaa aagggggtgc accaattaaa ggtgtaacct ttggagaaga 
tactgtttgg 

2761 gaagttcaag gttacaagaa tgtgagaatc acat.ttgagc ttgatgaacg 
tgttgacaaa 

2821 gtgcttaatg aaaagtgctc tgtctacact gttgaatccg gtaccgaagt 
tactgagttt 

2881 gcatgtgttg tagcagaggc tgttgtgaag actttacaac cagtttctga 
tctccttacc 

2941 aacatgggta ttgatcttga tgagtggagt gtagctacat tctacttact 
tgatgatgct 

3001 ggtgaagaaa acttttcatc acgtatgtat tgttcctttt accctccaga 
tgaggaagaa 

3061 gaggacgatg cagagtgtga ggaagaagaa attgatgaaa cctgtgaaca 
tgagtacggt 

3121 acagaggatg attatcaagg tctccctctg gaatttggtg cctcagctga 
aacagttcga 

3181 gttgaggaag aagaagagga agactggctg gatgatacta ctgagcaatc 
agagattgag 

3241 ccagaaccag aacctacacc tgaagaacca gttaatcagt ttactggtta 

3301 actgacaatg ttgccattaa atgtgttgac atcgttaagg aggcacaaag 

336i"atggtgattg taaatgctgc taacatacac ctgaaacatg gtggcggtgt 
agcaggtgca 

3421 ctcaacaagg caaccaatgg tgccatgcaa aaggagagtg atgattacat 
taagctaaat 

34 31 ggccotcr.ta cagtaggagg gtcttgtttg ctttctggac ataat.ctt.gc 
taagaagtgt 

354 1 ctgcatgttg ttggacctaa cctaaatgca ggtgaggaca tccagcttct 
taaggcagca 

3601 tatgaaaatt tcaattcaca ggacatctta cttgcaccat tgttgtcagc 
aggcatattt 

3661 ggtgctaaac cacttcagtc tttacaagtg tgcgtgcaga cggttcgtac 
acaggtttat 

3'?2l attgcagtca atgacaaagc tctttatgag caggttgtca tggattatct 
tgataacctg 

3781 aagcctagag tggaagcacc taaacaagag gagccaccaa acacagaaga 

3841 gaggagaaat ctgtcgtaca gaagcctgtc gatgtgaagc caaaaattaa 
ggcctgcatt 

9 1 1 igaaact aagtttctta ccaataagtt 

actcttgttt 
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3961 gctgatatca 


atggtaagct 


tgaagatatg 




4031 tccctccttg 


agaaggatgc 


tggtgatatc 




4081 acttgtgttg 


taataccctc 


ctcaagagct 




4141 ttgaagaaag 


tgccagttga 


atgCgctggt 




42 01 tatacacttg 


aggaagctaa 


ttaegtact:« 




4261 CCttcagaag 


cacctaatgc 


gaatttgaga 




4 321 gaaatgcttg 


ctcatgctga 


ggatgttaga 




4381 gccataatgg 


caaccatcca 


gggeategtt 




444 1 gaetatggcg 


tccgattctt 


tattattacg 




4 501 aagctgaact 


ctctaaatga 


gacaoatggt 




4561 tttaatcttg 


aagaggctgc 


cgc.agtguca 




4621 gtatcatcac 


cagatgctgt 


atcaaagaca 




4681 tctgaggagc 


actttgtaga 


ttggtc.ecat 




4 741 r. c a g g a c a g c 


gtacagagtt 


aactgtguac 




4801 cacactctgg 


agagccccgt 


acttgacaaa 




4861 ctaaagagtc 


tcttatccct 


aactgcggac 




4 921 aacaetaatc 


tccacacaca 


gcagtctggt 




4981 ccaacatact 


cggatggtgc 


tcatgagggt 




S041 aagaccttct 


ttgtactacc 


cgagtactac 




5101 catacectcg 


atgagagttt 


cacaaagaaa 




5161 Cggaaatttc 


ctcaagttgg 


caattgttat 




5221 ttgtctagtg 


ttttattagc 


accagcactt 




5281 caagaggctt 
actcatactc 


attatagagc 


5341 gcttacagta 


ataaaactgt 


gacccatctt 




54 01 ctacagcatg 


ctaatttgga 


taaacattgt 




5461 ggtcagaaaa 


ctactacctt 


tactctatct 




5521 tatgataatc 


ttaagacagg 


tgctacacaa 




5581 catctagtac 


aacaagagtc 


tgagtataaa 




5641 v. t. a c a q c a a g 


gtacattctC 


gtgtggtcat 




5701 tacactcata 


taactgctaa 


cctcacaaag 




5761 atgtcagagt 


acaaaggacc 


CtaCa 582"ac-at-aa c 
tgaaccaaaa 3 "" 3 ^ 3930 


ctq-atcgta 


5881 ttggatgggt 


attataaaaa 


agaccttgta 




5941 ccaactcaac 


cattaccaaa 


6001 aaatttgctg 


atgatttaaa 


acgagagcta 




606.1 tctgtcacat 


tcttcccaga 


tagacactat 





12, complete genome 

ttaccatgat tctcagaaca tgcttagagg 
accttacatg gtaggtgatg ttatcactag 
caaaaaggct ggtggcacta ctgagatgct 
tgagtatata accacgtacc ctggacaagg 
gactgctctt aagaaatgca aatctgcatt 
taaggaagag attctaggaa ctgtatcctg 
agagacaaga aaattaatgc ctatatgcat 
acgtaagtat aaaggaatta aaattcaaga 
cttttatact agtaaagagc ctgtagcctc 
gccgcttgtc acaatgccaa ttggttatgt 
gcgctgtatg cgctctctta aagctcccgc 
tactacatat aaCggatacc tcacttcgtc 
aacagtttct ttggctggct cttacagaga 
aggtgttgaa ttccttaagc gtggcgacaa 
cgagtctcat ctcgacggtg aggttctctc 
gcgggaggtt aagactaCaa aagtgttcac 
gct.tgtggat atgtctacga catatggaca 
CgatgCCaca aaaattaaac ctcatgtaaa 
tagtgatgac acactacgta gtgaagcttt 
tcctggtagg tacatgtctg ctttaaaeca 
tggtttaact tcaattaaat gggccgacaa 
acctcaaeag cttgaagcca aattcaacgc 
ccgtgctggt gatgctgcta acttctgcgc 
tggcgagctt ggtgatgtca gagaaacuat 
atctgcaaag cgagttctta atgtggtgtg 
aacgggtgta gaagctgcga tgtatatggg 
tgtttccatt ccatgtgtgt gtggtcgtga 
ttcttttgtt atgatgtctg caccacctgc 
atgcgcgaat gagtacactg gtaactatca 
ggagaccctc tatcgtattg acggagctca 
agtgactgat gttttctaca aggaaacatc 
taaactcgat ggagttactt acacagagat 
ggataatgct tactatacag agcagcctat 
tgcgagtttt gataatttca aactcacatg 
tcaaatgaca ggcttcacaa agccagcttc 
cttgaatggc gatgtagtgg ctattgacta 
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6181 caggctacaa ccaagacaac gttcaaacca aacacttggt gtttacgttg 
tctttggagt 

6241 acaaagccag tagatacttc aaattcattt gaagttctgg cagtagaaga 
cacacaagga 

6301 acggacaatc tcgcctgcga aagtcaacaa cccacccctg aagaagtagt 
ggaaaatcct 

6361 accatacaga aggaagtcat agagtgtgac gtgaaaacta ccgaagttgt 
aggcaatgtc 

6421 acacttaaac catcagatga aggtgttaaa gtaacacaag agtcaggtca 
tgaggatctt 

6431 atggcCgctt acgtggaaaa cacaagcatc accactaaga aacctaatga 
gctttcacta 

6S41 gcctcaggtt taaaaacaac tgccacccac ggtattgctg caattaatag 
tgttccttgg 

6601 agtaaaactt tggctlat.gr caaaccactc ttaggacaag cagcaattac 

6661 tgcgctaaga gattagcaca acgtgtgttt aacaattata tgccttatgt 
gtttacatta 

6721 ttgttccaat tgtgtacttt tactaaaagc accaatccta gaattagagc 
ttcactacct 

6781 acaactattg ctaaaaatag tgttaagagc gttgctaaat tatgtttgga 
tgccggcatt 

6841 aattatgtga agtcacccaa attttctaaa ttgttcacaa tcgctacgtg 
gctattgttg 

6901 ttaagtattt gcttaggttc tctaatctgc gtaactgctg cttttggtgt 
actcttatct 

6961 aattttggtg ctccttctta ttgtaatggc gctagagaac tgtatcttaa 
ttcgtctaac 

7021 gttactacta tggatttctg tgaaggttcc tttccttgca gcatttgttt 
aagtggatta 

7081 gactcccttg attcttatcc agctcttgaa accattcagg tgacgatttc 
atcgtacaag 

7141 ctagacttga caattttagg tctggccgct gagtgggttt tggcatatat 
gttgttcaca 

7201 aaattctttt atttattagg tctttcagct acaatgcagg tgttctttgg 
ctattttgct 

7261 agtcatttca tcagcaattc ttggctcatg tggtttatca ttagtattgt 
acaaatggca 

7321 cccgtttctg caatggttag gatgtacatc ttctttgctt ctttctacta 
catatggaag 

7381 agctatgttc atatcatgga tggttgcacc tcttcgactt gcatgatgtg 
ctataagcgc 

7441 aatcgtgcca cacgcgctga gtgtacaact attgttaatg gcatgaagag 
atctttctat 

7501 gtctatgcaa atggaggccg tggcttctgc aagactcaca attggaattg 
tctcaattgt 

7561 gacacatttt gcactggtag tacattcatc agtgatgaag ttgctcgtga 

7621 cagtttaaaa gaccaaccaa ccctactgac cagtcatcgt atattgttga 
tagtgttgct 

7681 gtgaaaaatg gcgcgcttca cctctacttt. gacaaggctg gtcaaaagac 
ctatgagaga 

7741 catccgctct cccattttgt caatttagac aatttgagag ctaacaacac 
taaaggttca 

7801 ctgcctatta atgtcatagt ttctgatggc aagtccaaat gcgacgagtc 
tgcttctaag 

7861 tccgcttctg tgtactacag tcagctgatg tgccaaccta ttctgttgct 
tgaccaagct 

7921 cttgtatcag acgttggaga tagtactgaa gtttccgtta agatgtttga 
tgcttatgtc 

7981 gacacctttt cagcaacttt tagtgttcct atggaaaaac ttaaggcact 
tgttgccaca 

8041 gctcacagcg agttagcaaa gggtgtagct ttagatggtg tcctttctac 
attcgtgtca 

8101 gctgcccgac aaggtgttgt tgataccgat gttgacacaa aggatgttat 
tgaatgtctc 

8161 aaactttcac atcactctga cttagaagtg acaggtgaca gttgtaacaa 
tttcatgctc 

8221 acctataata aggttgaaaa catgacgccc agagatcttg gcgcatgtat 

tgactgtaat 
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8281 gcaaggcata tcaatgccca 
ctggaatgta 

8341 aaagactaca tgtctttatc 
tgccaagaag 

8401 aacaacatac cttttagact 
tgtoataact 

84 61 actaaaatct cactcaaggg 
tatgcttaag 

8531. gccacattat tgtgcgttct 
agtacataca 

85-81 ttgtcaatcc atgatggtta 
tcaggatggt 

8S41 gtcactcgtg acatcatttc 
tggtttfcgae 

87G1 gcatggttta gccagcgtgg 

"878"; gctatcatta caagagagat 
tgtgetgaga 

8SS1 gcaatcaatg gtgacttctt 
tggcaaeatt 

8381 tgctacacac cttccaaact 
ttgcgfetsfet 

8S4I gctgctgagt gtacaatttt 
ttgttatgac 

9001 actaatttgc tagagggttc 
tegttatgtg 

tgctagagta 

9121 gtaacaactt ctgatgctga 
agaagcaggt 

9181 attcgcctat ctaccagtgg 
agctctaCCa 

9241 ggagttctct gtggtgttga 
tcctcutgtg 

9301 wacaq v> qLotctttaga 
tattgccata 

9361 ctggtgactt gtgctgccta 
tgagtacaac 

9421 catgttgttg ctgctaatgc 
ctgtctggta 

9481 ccagcttaca gctttctgcc 
gacattctat 

9541 ttoaccaatg atgtttcatt 
ttctcctatt 

9601 gtgccttttt ggataacagc 
ctgccattgg 

9661 ttctttaaea actatcttag 
tagtaccttc 

9721 gaggaggctg ctctgtgtac 
attgcgtagc 

9781 gagacactgt tgccacttac 
caagtacaag 

9841 tatttcagtg gagccttaga 
ccacttagca 

9901 aaggctctaa acgaccttag 
accacagaca 

9961 tcaatcactt ctgctgttct 
gtcaggcaaa 

10021 gttgaagggc gcatggcaca 
attgtggttg 

10081 gatgacacag tatactgtcc 
gcttaatcct 

10141 aactatgaag atctgcccac 
ggctggcaat 

10201 gttcaacttc gtgttattgg 
taaagttgat 

10261 actcctaacc ctaagacacc 
tcaaacattt 

10321 tcagttctag catgctacaa 
catgagacct 

10381 aatcatacca ttaaaggttc 
ttttaacatt 



12, complete 


genome 




agtagc a a a a. 






tgaacagctg 


cgtaaacaaa 




aacttgtgct 


a c aact agac 




tg9taagatt 




gt^ttlaact 


tgctgcattg 


gtttgttata 




cacaaatgaa 


atcattggtt 




tactgatgat 


tgttttgcaa 




tggt tcatac 


aaaaatgaca 


aaagctgccc 


tggtttcata 


gtgccr.ggct 


taccgggtac 


gcattttcta 


cctcgtgttt 


ttagtgctgt 


catt.gagtat 


a g t g a 1. 1 1 1 g 


ctacctctgc 


caaggatgct 


















gtactgtaga 




gcgaaaggtc 


n[l!glat 




agcat cacag 




ctcata[ctl 






fj cagtag'cgg 


c t g g 'c g g 'c a t 






g t g 1 1 1: t *c g g 


^ttttgttt 


ttgatgtctt 




gggagtctac 


tcagtctt t t 


act tgtact t 


cttggctcac 


ct,caatggt 


ttgccatgtt 


&atctat.gt;a 


tt.ctgr.attt 


ctctgaagca 


qaaaagagtc 


atgtttaatg 


gagr.tacat t 


ex. ttttgctc 


aacaaggaaa 


tgfcacctaaa 


a c a g L a * a a c 


aggtatcttg 


ctctatataa 






tc!aclac! 




gctgltgtt! 




gcag^gtggt 


tttaggaaaa 


tggcattccc 


agtaacctgt 


ggaac =caa 


ctcctaatgg 


aagacatgtc 








^ elegit 




ccattctatg 


caaa^t'.'^v.c 


v.gcc^aggct: 


caagta a&a 


tttgtccgta 


tccaaccrtgg 


tggttcacca 


tctggtgttt 


ateagtgtgc 


tttccttaat 


ggatcafcgtg 


gtagtgttgg 



http://www.ncbi.nini.nih.gov/nuccore/30248028 



Nucleotide - SARS coronayirus TOF 

10141 gafcatgatt gcgcgtctu; 
aggagfacac 

10'iOl getggtactg acttagaagg 
aactgcacag 

10561 gctgcaggta cagacacaac 
tgctgctgct 

10621 atcaar.ggt.g ataggtggtt 
cr.ttaacctt 

10681 gtggcaatga agr.acaacta 
attgggacct 

10741 ctttctgctc aaacaggaat 

10801 cagaatggta tgaatggtcg 
tgagtttaca 

10861 ccatttgatg ttgttagaca 

10321 gttaagggca ctcatcattg 
gattcttgtt 

10981 caaagtacac agtggtcact 
gccatt.tact 

11041 cttggtacta tggcaattgc 
gcacgcattc 

11101 ttgtgcctgc t-.tctgtc.acc 
<>•" •>-'--' ! - 

11161 cctgctagct gggtgatgcg 

11221 ggttataggc ttaaggattg 
tattctcatg 

11281 acagctcgca ctgtttatga 
gaatgtcatt 

11341 acacttgttt acaaagtcta 
catgtgggcc 

11401 ttagttattt ctgtaacctc 
gtttttagct 

11461 agagctatag tgtutgtgtg 
tggcaacacc 

11521 ttacagtgta tcacgcttgt 




ctacttggtc 

11641 tctacacaag aatttaggta 
gagtagtact 

11701 gatgctttca agcttaacat 
tatcaaggtt 

11761 gctactgtac agtctaaaat 
gctctcggtt 

11821 cttcaacaac ttagagtaga 
acaactccac 

11881 aatgatattc Ctcttgcaaa 
ttctcttttg 

11941 tctgttttgc catccatgea 
ggaaatgctc 

12001 gataaccgcg ctactcttca 

12061 gcttatgcca ctgcccagga 
ttctgaagtc 

12121 gttctcaaaa agttaaagaa 
ccgtgatgct 

12181 gccacgcaac gcaagttgga 
gtacaaacag 

12241 gcaagatctg aggacaagag 
gctcttcact 

12301 atgc^taqgii aqctcgaraa 
gcgtgatggt 

12361 tgtgttccac tcaacatcat 
tgttgtccct 

12421 gattatggta cctacaagaa 

12481 tgggaaatcc agcaagttgt 

12541 atggacaatt caccaaattt 
agccaactca 
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ctgctatatg catcatatgg agcttccaac 
taaattctat ggtccatttg ttgacagaca 
cataacatta aatgttttgg catggctgta 
tcttaataga ttcaccacta cttt'gaatga 
tgaacctttg acacaagatc atgttgacat 
tgccgtctta gatatgtgtg ctgctttgaa 
tactatcctt ggtagcacta ttttagaaga 
atgctctggt gttaectt.cc aaggtaagtt 
gatgctttta actttcttga catcactatt 
gtttttcttt gtttacgaga atgctttctt 
tgcatgtgct atgctgcttg ttaagcataa 
ttctcttgca acagttgct.t actttaatat 
tatcatgaca tggcttgaat tggctgacac 
tgttatgtaC gcttcagctt tagttttgct 
tgatgctgct agacgtgttt ggacactgat 
ctatggcaat gctttagatc aagctatttc 
taactattct ggtgtcgtta cgactatcat 
tgttgagtat cacccattgt tatttattac 
ttattgtttc ttaggctatt gttgctgctg 
ttacttcagg cttactcttg gtgtttatga 
tatgaactcc caggggctct tgcctcctaa 
taagttgttg ggtafcggag gtaaaccatg 
gtctgacgta aagtgcacat ctgtggtact 
gtcatcttct aaattgtggg cacaatgtgt 
agacacaact gaagctttcg agaagatggt 
gggtgctgta gacattaata ggttgtgcga 
ggctattgct tcagaattta gttctttacc 
ggcctatgag caggctgtag ctaatggtga 
atctttgaat gtggctaaat ctgagtttga 
aaagatggca gatcaggcta tgacccaaat 
ggcaaaagta actagtgcta tgcaaacaat 
tgatgcactt aacaacatta tcaacaatgc 
accattgact acagcagcca aactcatggt 
cacttgtgat ggtaacacct ttacatatgc 
tgatgcggat agcaagattg ttcaacttag 
ggcttggcct cttattgtta cagctctaag 
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12601 gctgttaaac 


tacagaataa 


tgaactgagt 


ccagtagcac 


tacgacagat 


gtcctgtgcg 

12661 gctggtacca 


cacaaacagc 


ttgtactgat 


gacaatgcac 


ttgcctacta 


fcaacaattcg 

12721 aagggaggta 


ggtttgtgct 


ggcattacta 


tcagaccacc 


aagatcccaa 


> r 3 • a 9 










12781 ttccctaaga 


gtgacggtac 


aggtacaatt 


tacacagaac 


tggaaccacc 


ttgtaggttc 










12841 gttacagaca 


caccaaaagg 


gcctaaagtg 


aaatacttgt 


acttcatcaa 


aggcctaaac 










12901 aacctaaata 


gaggtatggt 


gctgggcagt. 


ttagcEgcta 


cagtacgtct 


tcaggctgga 










129SI aacgctacag 


aagtacctgc 


caattcaact 


gtgctttcct 


tctgtgcttt. 


tgeagtagac 










13021 ectgctaaag 


catataagga 


ttacctagca 


agtggaggac 


aaccaatcac 


caactgcgtg 










13081 aaqar.gtv.gt 


gtacacacac 


tggtacagga 


caggcaat ta 


ctqtaacacc 


agaagctaac 










13141 atggaccaag 


agtectttgg 


tggtgcttca 


tgttgtctgt 


attgtagatg 


ccacatcgac 










13201 cacccaaatc 
acccaccacc 


Ctaaaggatt 


ctgtgacttg 


aaaggtaagc 


acgtccaaat 


13261 tgcgctaatg 


acccagtggg 


ttttacactt 


agaaacacag 


tctgcaccgt 


etgcggaatg 










1 3 321 tggaaaggtt 


atggctgtag 


ttgtgaccaa 


ctccgegaac 


ccctgatgca 


gtctgcggac 










13?. SI geatcaacgt 


ttttaaacgg 


gtttgcggtg 


taagtgcagc 


ccgteceaca 


r.g 3 gjgcs 










1344 1 caggcactag 


tactgatgtc 


gtctacaggg 


cttttgatat 


ttacaacgaa 












13501 gtttcgcaaa 


gttcctaaaa 


actaattgct 




ggagaaggac 


gaggaaggca 










13561 atfctattaga 




gtagttaaga 


ggcatactac 




caaeatgaag 











a taacttggtt aaagattgtc cagcggttgc tgtccatgac 
uttcteaagt 

13S31 etagagtaga tggtgacatg gtaccacata tatcacgtca gcgtctaact 
aaatacacaa 

13741 tggctgattt agtctatgct ctacgtcatt ttgatgaggg taattgcgat 
acattaaaag 

13801 aaatactcgt cacatacaat tgctgtgatg acgattattc caataagaag 

13861 acttcgtaga gaatcctgac accttacgcg tatatgctaa cttaggcgag 
cgtgtacgcc 

13921 aatcattatt aaagactgta caattctgcg aCgctatgcg tgatgcaggc 
attgtaggcg 

13981 tactgacatt agataatcag gatcttaatg ggaactggta cgatttcggt 

14041 aagtagcace aggctgcgga gctcctattg tggattcata ttactcattg 
ctgatgccca 

14101 tcctcacttt gactagggca ttggctgctg agtcccatat ggacgccgat 
ctcgcaaaac 

14161 cacttattaa gtgggatttg ctgaaatatg attttacgga agagagactt 
tgtctcttcg 

14221 accgttattt caaatattgg gaccagacat accatcccaa ttgtattaac 
tgtttggatg 

14281 ataggcgtat ccttcattgt gcaaacctca atgtgttatt ttctactgtg 
tctccaccta 

14341 caagttttgg accactagc.a agaaaaatat ttgtagatgg tgttcctttt 
gttgtttcaa 

14401 ctggatacca ttctcgtgag ttaggagtcg tacataatca ggatgtaaac 
ttacatagct 

14461 cgcgtctcag tttcaaggaa cttttagtgt atgctgctga tccagctatg 
~14521 ctggcaattt attgctagat aaacgcacta catgcttttc agtagctgca 



14581 atgttgcttt tcaaactgtc aaacccggta attttaataa agacttttat 

gactttgctg 

14641 tgtctaaagg tttctttaag gaaggaagtt ctgttgaact aaaacacttc 
ttctttgctc 

14 7 01 aggatggcaa cgctgctatc agtgattatg actattatcg ttataatctg 
ccaacaatgt 
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ctegtagteg 


aagttgttga 




gattgttacg 

14321 atggtggctg 


tattaatgee 


aaccaagtaa 


ccgttaacaa 


tctggataaa 


tcagccggtt 










14381 tcecattcaa 


taaatggggt 


aaggctagac 


tttattatga 


ctcaatgagt 


tatgaggatc 

14341 aagatgcscr 


tttcgegtat 


actaagegta 


atgtcatccc 


tactataact 


caaatgaate 










15001 tcaagtatgc 


cattagtgea 


aagaatagag 


ctcgcaccgt 


agctggtgtc 


tctatctgta 










15061 gtactatgac 


aaatagaeag 


tttcatcaga 


aattattgaa 


gtcaatagee 


gecaet.agag 










15121 gage t act g t 


ggtaattgga 


acaagcaagt 


tttacggtgg 


ctggcataat 


aegteaaaaa 










15181 otqttCacag 


tgatgtagaa 




ttatgggttg 


ggattatcca 


aaatgtgaca 










15241 gagccafcgcc 
'' * aca 


taacatgett 


aggataatgg 


cctctcttgt 


tcttgctcgc 


15301 cttgctgtaa 
caagtatr,aa 


cttatcacac 


cgtttctaca 


ggttagctaa 


cgagtgtgcg 


15361 gtgagac.ggt 


catgtgtggc 


ggctcactat 


atgttaaacc 


aggtggaaca 


teateeggtg 










15421 atgetacaac 


tgettatget 


aatagtgtct 


ttaacatttg 


teaagetgtt 


acagecaafcg 










15481 taaatgeact 


tctttcaact 


gatggtaata 


agatagctga 


caagtatgtc 


cgcaafcctac 










15541 aacacaggct. 


ctatgagtgt 


ctctatagaa 


atagggatgt 


tgatcatgaa 


ttcgtggatg 










15601 agttttacgc 


ttacctgcgt 


aaacatttct 


ccatgatgat 


tctttctgat 


gatgecgetg 










15661 tgtgctataa 


cagtaact at 


qcggctcaag 


gtttagtagc 


tagcattaag 


aaccttaagg 










15721 cagtccttta 


ttatcaaaat 


aatgtgttca 


tgtctgaggc 


aaaatgttgg 


actgagacr.g 










15731 accttactaa 


aggacctcac 


gaat.cttgct 


cacagcatac 


aatgctagtt 


aaacaaggag 

15841 atgattacgt 


gtacctgcct 


tacccagatc 


catcaagaat 


attaggegea 


ggccgttttg 










15901 tegatgatat 


tgtcaaaaca 


gatggtacac 


ttatgattga 


aaggttcgtg 


tcactggcta 










15961 ttgatgetta 


cccactcaca 


aaacatccta 


atcaggagta 


tgcCgatgtc 


tttcactcgt 










16021 atttacaata 


cattagaaag 


ttacatgatg 


agcttactgg 


ccacatgttg 


gacatgtatt 










16081 cegtaatget 


aactaatgat. 


aacacctcac 


ggtactggga 


acctgagttt 


tatgaggcta 










16141 t.gtacacacc 


acatacagtc 


ttgcaggctg 


taggtgcttg 


tgtattgtgc 


aat.tcacaga 










16201 ctccacttcg 


t~-tgcgc?c.gcc 


tgtat tageja 


gaccattcct 


atcrt tgcaag 


rgcugcrarg 










16261 accatgtcac 


tCcaacatca 


cacaaactag 


tgctgtctgt 


taatccctat 


g'cttgcaatg 










16321 ccccaggtcg 


tgatgtcact 


gaegtgacac 


aactgtatct 


aggaggtatg 


agctactact 

16331 gcaagtcaca 


taagcctccc 


attagttttc 


cattatgtgc 


taatggtcag 


gtfrefctggfcfe 










16441 tatacaaaaa 


cacatgtgta 


ggcagtgaca 


atgtcactga 


ettcaatgeg 


atagcaacat 










16501 gtgattggac 


taatgctggc 


gattacatac 


ttgccaacac 


ctgtactgag 


agactcaagc 










16561 Etttcgcagc 


agaaacgetc 


aaagccactg 


aggaaacact 


taagctgtca 


tatggtattg 










16621 ccactgcacg 


cgaagtactc 


tctgacagag 


aattgeatet 


ttcatgggag 


gfeggaaaac 










16681 ctagaccacc 


attgaacaga 


aactatgtct 


,. tact ti . a 


ccgtg^aact 


aaaaatagta 
















aaggegact: a 




gttgtgtaca 










15801 gaggtactac 


gacatacaag 


ttgaatgttg 


gtgattactt 


tgegttgaea 


16861 taatgecact 


tagtgeaect 


actctagtgc 


cacaagagca 


ctatgtgaga 
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16921 tgtacccaac actcaacatc tcagatgagt tttctagcaa tgttgcaaat 
t j atgca aaagtactct acactccaag gaccacctgg tactggtaag 

agtcattttg 

17041 ceateggaet tgctctctat tacccatctg ctcgcatagt gtatacggca 
tgctctcatg 

17101 cagctgttga tgccctatgt gaaaaggcac taaaatattt gcccatagat 
aaatgtagta 

17161 gaatcatacc tgcgcgtgcg cgcgtagagt gttttgataa attcaaagtg 

17221 tagaacagta tgttttctgc actgtaaatg cattgccaga aacaactgct 
gacattgtag 

17281 tctttgatga aatctctatg gctactaatt atgacttgag tgttgtcaat 
gctagacttc 

17341 gcgcaaaaca ccacgtctac attggcgatc ctgctcaatt accagccccc 
cgeacaEtgc 

17401 tgactaaagg cacactagaa ccagaatatc ttaattcagt gtgcagacut 
atgsaaacaa 

17461 taggtccaga catgttcctt ggaactcgtc gccgttgtcc tgccgaaatt 
gttgacacfcg 

17521 tgagtgettt agtctatgac aataagctaa aagcacacaa ggataagtca 
gctcaatgct: 

17581 tcaaaatgtt ctacaaaggt gttattacac atr.gatgtttc atctgcaatc 
aacagaccte 

17641 aaataggcgt tgtaagagaa tttcttacac gcaatcctgc ttggagaaaa 
gctgttttta 

17701 tctcacctta taattcacag aacgctgtag cttcaaaaat cttaggattg 
Gcfcabgcaga 

17761 ctgttgattc atcacagggt tctgaatatg actatgtcat attcacacaa 
actaccgaaa 

17821 cagcacactc ttgtaatgtc aaccgcttca atgtggctat cacaagggca 
aaaattggca 

17381 ttttg'tgcat aatgtctgat agagatcttc atgacaaact gcaatctaca 
agtctagaaa 

17S41 Caceaegtcg caaugtggct acattacaag cagaaaatgt aactggacet 
ttta$gf$c:t : : 

igat cattactggt cttcatccta cacaggcacc tacacacctc 

agcgttgata 

18061 taaagttcaa gactgaagga ttatgtgttg acataccagg cataccaaag 
gacatgacct 

18121 accgtagact catctctatg atgggtttca aaatgaatta ccaagtcaat 
ggttacccta 

13181 atatgtttat cacccgcgaa gaagctattc gtcacgttcg tgcgtggatt 
ggcttcgatg 

18241 tagagggctg tcatgcaact agagatgctg tgggtactaa cctacctctc 
cagctaggat 

18301 tttctacagg tgttaactta gtagctgtac cgactggtta tgttgacact 
gaaaataaca 

18361 cagaattcac cagagttaat gcaaaacctc caccaggtga ccagtttaaa 
catcttatac 

18421 cactcatgta taaaggcttg ccctggaatg tagtgcgtat taagatagta 
caaatgctca 

18481 gtgatacact gaaaggattg tcagacagag tcgtgUtcgt cctttgggcg 
catggctttg 

18541 agcttacatc aatgaagtac tttgtcaaga ttggacctga aagaacgtgt 
tgtctgtgtg 

18601 acaaacgtgc aacttgcttt tctacttcat cagatactta tgcctgctgg 
aatcatCctg 

18661 tgggttttga ctatgtctat aacccattta tgattgatgt tcagcagtgg 
ggctttacgg 

1872 1 gtaacctcca gagtaaccat gaccaacaCt gccaggtaca tggaaacgca 
catgtggcta 

18731 gttgcgacgc tatcatgact agatgttcag cagtccacga gcgctttgtc 
aagcgcgttg 

18841 attggtctgt tgaataecct atcacaggag atgaactgag ggttaattcc 
gcttgcagaa 

18901 aagtacaaca catggttgtg aagtctgcat tgctcgctga taagttucca 

gttcttcaCg 

18961 aeafcggaaa tccaaaggct atcaagtgtg tgcctcaggc tgaagtagaa 
tggaagttct 

19021 acgatgctca gccatgtagt gacaaagctt acaaaataga ggaactcttc 
tattcttatg 
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19081 ctacacatca 
19141 gttacccagc 

19201 taccaggctg 
actccagctt 

19261 tegataaaag 
tctgatagtc 

19321 cttgtgagtc 
ctcaaatctg 

19381 ctacgtgtat 
gcaaatgagt 

19441 accgacagta 

19501 acaaacaatt 

19561 atgtggctta 
gaagcacctg 

19621 tttccatcat 
gagatctttg 

19681 aaaataagac 
cgtaaeatta 

19741 aaccagtgcc 
gctaatactg 

19801 taatctggga 
gtctgcacaa 

19861 tgactgacat 
gtcttgtttg 

19921 atggtagagt 
gttttaataa 

19981 cagaaggttc 
agcgtcaatg 

20041 gagtcacatt 
aaagtagacg 

20101 gcattattca 
gaggatttta 

20161 agcccagatc 
ttcatacagc 

2 0221 gatataagct 
agtcatggac 

20281 aacttggcgg 

20341 aattagagga 
acagatgcgc 

20401 aaacaggttc 
gactttgtcg 

2 0461 agataataaa 

20521 atgctgaaat 

20581 aactacaagc 
tacaagatgc 

2 0641 aaagaatgct 
gttataccaa 

20701 aaggaataat 
aatacactta 

20761 ctttagctgt 
gataaaggag 

20821 ttgcaccagg 

20881 cagatcttaa 
tgtgcaacag 

20941 tacatacggc 
aggaccaaac 

21001 atgtgacaaa 
ggatttataa 

21061 agcaaaaact 
tcttggaatg 

21121 ctgaccttta 
aeaaatgcaa 

21181 atgcatcatc 
ccgaaggaac 



TOR2, compile 



caatgcaatt 


gtgfcgfcaggfc 


tgatggtggt 




tgcatttact 


aatttaaago 


tcatggcaaa 


caagtagf.gt 


tacacgatgc 


aatttaggtg 


cttggatgca 


tataatatga 


tgatacttat 


aacctgtgga 


taatgttgtt 


aataaaggac 


taataatgct 


gtttacacaa 




gttaatgttg 


agagattaag 


atactcaata 


ctacaaaaga 


gaagccccag 


tgccaagaaa 


cctactgaga 


ggaaggacag 


gtagaccttt 


agtcaaaggt 




aattggagaa 


tcagtaaaaa 


acagttgcct 


gaaacc ac 


acaaatggaa 


actgactttc 


cgagggctat 


gccttcgaac 


tcttcattta 


atgataggct 


ttttatccct 


atggacagca 


atcaaaacgt 


gtgtgttctg 


gtcacaagat 


ttgtcagcga 


ttcattcatg 


ctttggtgta 


aagtcaagcg 


tggcaaccag 


tcttgaaaag 


tgtgaccttc 


gatgaatgtc 


gcaaagtata 


ac^ctacaac 


atgagagtta 


tacagctgtg 


ctcagacaat 


tgacttcgtc 


tccgacgcag 


taataaatgg 


gaccttatta 


agagaatgac 


tctaaagaag 


agccctgggt 


ggttctatag 


caagcttatg 


ggccatttct 


atcggaagca 


tttttaattg 



genome 




t q a a t a a g c a t g c a 1 1 c c a c 
aafcfegeetfcfc cttttacfcat 
. rnttga ttatgttcca 
gtgctgtttg cagacaccat 
tgatttctgc tggatttagc 
atacatttac caggttacag 
actttgatgg acacgccggc 
aggtagatgg cattgatgtg 
catttgagct ttgggctaag 
atttgggtgt Cgatatcgct 
cacatgtatc cacaataggt 
gtgcttgttc ttcacttact 
ttagaaacgc ccgCaatggt 
caaagggacc agcacaagct 
cacagtttaa ctactttaag 
ttactcagag cagagactta 
tcgagctcgc tatggatgaa 
acatcgttta tggagatttc 
tagccaagcg ctcacaagac 
cagtgaaaaa tcacttcata 
tgattgatct tctacttgac 
tttcaaaagt ggtcaaggtt 
aggatggaca tgttgaaacc 
gtgttgcgac gcctaacttg 
agaattatgg tgaaaatgct 
ctcaactgtg tcaatactta 
ttcactttgg tgctggctct 
ggttgccaac tggcacacta 
attccacttt aattggagac 
ttagcgatat gtatgaccct 
ggtttttcac ttatctgtgt 
ctgtaaagat aacagagcat 
catggtggac agcttttgtt 
gggctaacta tcttggcaag 
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21241 aaattgatgq 


ctataccatg 


catgctaact 


aatcctatcc 






21301 agttgtcttc 


ctaetcactc 


tttgacatga 


agaggaactg 






21361 ctgtaatgtc 


tcttaaggag 


aatcaaatca 


ctggaaaaag 






21421 gtaggcttat 


cattagagaa 


aacaacagag 


cttgttaaca 






21481 actaaacgaa 


catgtttatt 


ttcttattat 


agtgaccttg 






21541 accggtgcac 


cacttctgat 


gatgttcaag 


acttcatcta 






21601 tgaggggggt 


ttactatcct 


gatgaaattt 


ttaactcagg 






21661 attcatttct 


tccattttat 


tctaatgtta 


catacgtttg 






21721 gcaaccctgt 


catacctttt 


aaggatggta 


aaatcaaatg 






21781 ttgtccgtgg 


ttgggttttt 


ggttctacca 


gtgattatta 






21841 ttaacaattc 


cactaatgtt 


gttatacgag 


gacaaccctt 






21901 tctttgctgt 


ttctaaaccc 


atgggcacac 


gataatgcat 






21961 ttaattgcac 


tttcgagtac 


atatetgatg 


gaaaagtcag 






22021 gtaattttaa 


acacttacga 


gagtttgtgt 


ctctatgttt 






22081 aCaagggcta 


tcaacctata 


gatgtagttc 


aacactttga 






22141 aacctatttt 


taagttgcct 


cttggtatta 


attcttacag 






22201 ccttttcacc 


tgctcaagac 


atttggggca 


gttggccatt 






22261 taaagccaac 


tacatttatg 


ctcaagtatg 


gatgctgttg 






22321 attgttctca 


aaatccactt 


gctgaacuca 


gagattgaca 






22381 aaggaattta 


ccagacctct 


aatttcaggg 


gcgagattcc 






22441 ctaatattac 


aaacttgtgt 


ccttttggag 


ttcccttctg 






22501 tctatgcatg 


ggagagaaaa 


aaaatttcta 


gtgctctaca 






22561 actcaacatt 


tttttcaacc 


tttaagtgct 


ttgaatgatc 






22621 tttgcttctc 


caatgtctat 


gcagattctt 


gtaagacaaa 






22681 tagcgccagg 


acaaactggt 


gttattgctg 


gatgatttca 






22741 tgggttgtgt 


ccctgcttgg 


aatactagga 


ggtaattata 






22801 attataaata 


taggtatctt 


agacatggca 


gacatatcta 






22861 atgtgccttt 


ctcccctgat 
cccc ga 


acrcaaacctt 


tgttattggc 






22921 cattaaatga 


ttatggtttt 


tacaccacta 


tacagagttg 






22981 tagtactttc 
aaattatcca 


tttcgaactt. 


ttaaatgcac 


2 3 041 ctgaccttat 


taagaaccag 


tgtgtcaatt 


ggtactggtg 






23101 tgtcaactcc 


ttcctcaaag 


agatttcaac 


gatgtttctg 












tcaccttgcg 






23221 cttttggggg 


tgtaagtgta 


attacacctg 


gttgctgttc 






23281 tatatcaaga 


tgttaactgc 


actgatgttt 


23341 cagcttggcg 


catatattct 


actggaaaca 



ggctgtctta 
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atgatatgat ctattctctt 
ttgtggtttc aagtgatatt 
ttcttactct cactagtggt 
ctcctaatta cactcaacat 
ttagatcaga cactctttat 
cagggdttca tactattaat 
tttattttgc tgccacagag 
tgaacaacaa gtcacagtcg 
catgtaactt tgaattgcgt 
agacacatac tatgatattc 
ccttttcgct tgatgttcca 
ttaaaaacaa agatgggctt 
gtgatctacc ttctggtttt 
acattacaaa ttttagagcc 
cgtcagctgc agcctatttt 
atgaaaaugg tacaatcaca 
aatgctctgt taagagcttt 
ttgttccctc aggagatgtt 
aggttttcaa tgctactaaa 
attgtgttgc tgattactct 
atggcgtttc tgccactaag 
ttgtagtcaa gggagatgat 
attataatta taaattgcca 
acattgatgc tacttcaact 
agctcaggcc ctttgagaga 
g caccccacc tgctcttaat 
zt g hi - jg :taccaacct 
cggccacggt ctgtggacca 
ttaattttaa tggactcact 
catttcaaca atttggccgt 
catctgaaat attagacatt 
gaacaaatgc ttcatctgaa 
ctacagcaat tcatgcagat 
atgcattcca gactcaagca 



Nucleotide - SARS coro 

23401 taggagctga 
gctggcattt 

23461 gtgctagtta 
attgtggctt 

23521 acactatgtc 
attgctatac 

23531 ctactaactt 
gctaaaacct 

23641 ccgtagattg 
ttgcttctcc 

23701 aatatggtag 
gctgaacagg 

23761 atcgcaacac 
ccaactttga 

23821 aatattttgg 

23881 ggtcttttat 
ggcttcatga 

23941 agcaatatgg 
gcgcagaagt 

24001 tcaatggact 
gcctacactg 

24061 ctgctctagt 
gctgctcttc 

24121 aaataccttt 
acccaaaatg 

24181 ttctctatga 
agtcaaattc 

24241 aagaatcact 
gttaaccaga 

24301 atgctcaagc 

24361 gtgcgctaaa 
caaattgaca 

24421 ggttaattac 
ctaatcaggg 

24481 ctgctgaaat 
tgtgttcttg 

24541 gacaatcaaa 
ttcccacaag 

24601 cagccccgca 
gagaggaact 

24661 tcaccacagc 
gaaggtgttt 

24721 ttgtgtttaa 
ccacaaataa 

24781 ttactacaga 

24841 acacagttta 
ctggacaagt 

24901 acttcaaaaa 
attaacgctt 

24961 ctgtcgtcaa 
aatttaaatg 

25021 aatcactcat 
tggccttggt 

25081 atgtttggct 
atcttgcttt 

2 5141 gttgcatgac 
tcttgctgca 

25201 agtttgatga 
tacacataaa 

25261 cgaacttatg 
cacagccagt 

2 5321 aaaaattgac 

25381 agcctcactc 

agggcttcca 

25501 gctcatttgc 
tgcttgtcgc 



navirus TOR2, complete 

gcatgtcgac acttcttatg 

tctaggtgct gatagttcaa 
ttcaattagc attactacag 
taatatgtac atctgcggag 
cttttgcaca caactaaatc 
acgtgaagtg ctcgct caag 
tggttttaat ctttcacaaa 
tgaggacttg ctcfcttaata 
cgaatgccta ggtgatatta 
tacagtgttg ccaccCctgc 
tagtggtact gccactgccg 
tgctatgcaa atggcataca 
gaaccaaaaa caaatcgcca 
tacaacaaca tcaactgcat 

tgatatcctt tcgcgacttg 
aggcagactt caaagccttc 
cagggcttct gctaatcttg 
aagagttgac ttttgtggaa 
tggtgttgtc ttcctacacg 
gccagcaatt tgtcatgaag 
tggcacttct tggtttatta 
caatacattt gtctcaggaa 
fcgatcctctg caacctgagc 
tcatacatca ccagatgttg 
cattcaaaaa gaaafcgacc 
tgaccttcaa gaattgggaa 
cggcttcatt gctggactaa 
tagctgttgc agttgcctca 
ggatgactct gagccagttc 
gatttgttta tgagattttt 
aatgcttctc ctgcaagtac 
cctttcggat ggcttgtCat 
ataattgcgc tcaataaaag 
aatttactgc tgctatttgt 



genome 




agtgcgacac 


tcctattgga 


gtagtactag 


ccaaaaatct 






aagtaatgcc 


cgtttctatg 


attctactga 


atgtgctaat 


gtgcactctc 


aggtattgct 


ticaaacaaat 


gtacaaaacc 


tattacctga 


ccctctaaag 


aggtgacact 


cgctgatgct 


atgctagaga 


tctcatttgt 


tcactgatga 


tatgattgct 


gatggacatt 


tggtgctggc 


ggttcaatgg 


cattggagtt 




caaggcgatt 


tgggcaagct 


gcaagacgtt 


aau. ag._ c 




ataaagtcga 


ggcggaggta 


aaacctatgt 




ctgctactaa 


aatgtctgag 


agggctacca 


ccttatgtcc 


tcacgtatgt 


gccatcccag 


gcaaagcata 


cttccctcgt 


cacagaggaa 


cttcttttct 


attgtgatgt 


cgttattggc 


ttgactcatt 


caaagaagag 


atcttggcga 


catttcaggc 


gcctcaatga 


ggtcgctaaa 


aatatgagca 


atatattaaa 


ttgccatcgt 


catggttaca 


agggtgcatg 


ctcttgtggt 


tcaagggtgt 


caaattacat 


tactcttaga 


tcaattactg 


tgttcatgct 


acagcaacga 


tggcgttgca 


tttcttgctg 


atggcagcta 




taccatctat 


tcacatctct 



Page 19 of 21 



http://www.ncbi.nlm.nih.gov/nuccore/30248028 



Nucleotide - SARS coronavirus TOR2, complete genome Page 20 of 2 i 



tacct ctatgccttg atatattttc 

25621 caacgcatgt agaattatta tgagatgttg gctttgttgg aagtgcaaat 

25681 attactttat gatgccaact actttgtttg ctggcacaca cataactatg 

2 5741 accatataac agtgtcacag atacaactgt cgttactgaa ggtgacggca 
tttcaacacc 

25801 aaaactcaaa gaagactacc aaattggtgg ttattctgag gataggcact 
caggCgttaa 

25861 agactatgtc gttgtacatg gctatttcac cgaagtttac taccagcctg 



i gacactggta ttgaaaatgc tacattcctc atctttaaca 

25981 agacccaccg aatgtgcaaa tacacacaat cgacggccct tcaggagttg 

26041 aatggatcca atttatgatg agccgacgac gactactag-c gtgcctttgt 

26101 aagtgagtac gaacttatgt actcattcgt ttcggaagaa acaggtacgt 
taatagttaa 

26161 tagcgtactt ctttttcttg ctttcgtggt attcttgcta gtcacactag 
ccatccttac 

26221 tgcgcttcga ttgtgtgcgt accgctgcaa tattgttaac gtgagtttag 



26281 ggtttacgtc tactcgcgtg ttaaaaatct gaactcttct gaaggagttc 
ctgatcttct 

26341 ggtctaaacg aactaactat tattattatt ccgtttggaa ctttaacatt 
gcttatcatg 

26401 gcagacaacg gtactattac cgttgaggag cttaaacaac tcctggaaca 

264'61 gtaataggtt tcctattcct agcctggatt atgttactac aatttgccta 
ttctaatcgg 

26521 aacaggtttt. tgtacat.aat aaagcttgtt ttectctggc cctt.gtggcc 

26581 gcttgttttg tgcttgctgc tgtctacaga attaattggg tgactggcgg 
gattgcgatt 

26641 gcaatggctt gtattgtagg cttgatgtgg cttagctact tcgttgcttc 
cttcaggctg 

26701 tttgctcgta cccgctcaat gcggtcattc aacccagaaa caaacattct 
tctcaatgtg 

26761 cctctccggg ggacaattgt gaccagaccg cteatggaaa gtgaacttgt 
cattggtgct 

26821 gtgatcattc gtggtcactt gcgaatggcc ggacactccc tagggcgctg 
tgacattaag 

26881 gacctgccaa aagagatcac tgtggctaca tcacgaacgc tttcttatta 
caaattagga 

26941 gcgtcgcagc gtgtaggcac tgattcaggt tttgctgcat acaaccgcca 
ccgtattgga 

27001 aactataaat caaatacaga ccacgccggt agcaacgaca atattgcttt 
gctagtacag 

27061 taagtgacaa cagatgtttc atcttgttga cttccaggtt acaatagcag 
agatattgat 

27121 tatcattatg aggactttca ggattgctat ttggaatctt gacgttataa 
taagttcaat 

27181 agtgagacaa ttatttaagc ctctaactaa gaagaattat tcggagttag 

atgatgaaga 

27241 acctatggag ttagattatc cataaaacga acatgaaaat tattctcttc 
ctgacattga 

27301 ttgtatttac atcttgcgag ctatatcact atcaggagtg tgttagaggt 
acgactgtac 

27361 tactaaaaga accttgccca tcaggaacat acgagggcaa Ctcaccattt 
caccctcttg 

27421 ctgacaataa atttgcacta acttgcacta gcacacacct tgcttttgcc 
tgtgctgacg 

27481 gtacxcgaca tacccatcag etgcgtgcaa gatcagtttc accaaaactt. 

27541 aagaggaggt tcaacaagag ctctactcgc cactttttct cattgttgct 
gctctagtat 

27601 ttttaatact ttgcttcacc attaagagaa agacagaatg aatgagctca 

ctttaattga 

27661 cttctatttg tgcttttcag cctttctgct attccttgtt ttaataatgc 
ttattatatt 
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27721 ttggttttca ctcgaaatcc 
aaacgaacat 

27781 gaaacttctc attgttttga 
ctgtagtaca 

27841 gcgctgtgca tctaataaac 

actta r.agcactgcc 

27961 ggcacactat ggttcaaaca 
gatccagctg 

28021 gtggtgcgct tatagctagg 

28081 gagacgtact tgttgttcta 
tggaccccaa 

28141 tcaaaccaac gtagtgcccc 
aactgacaat 

28201 aaccagaatg gagga , 
aggtttaccc 

28261 aataatactg cgtcttggtt 
acttagattc 

28321 cctcgaggcc agggcgttcc 
aattggctac 

28381 taccgaagag ctacccgacg 
gctcagcccc 

28441 agatggtact tctattacct 
cggcgctaac 

2 8 501 aaagaaggca tcgtatgggt 
agaccacatt 

28561 ggcacccgca atcctaataa 
aggaacaaca 

28621 ttgccaaaag gcttetacgc 
ttctcgctcc 

28681 tcatcacgta gtcgcggcaa 
gggaaattct 

28741 cctgctcgaa tggctagcgg 
gctagacaga 

28801 ttgaaccagc ttgagagcaa 
ccaaactgtc 

28861 ac'caagaaat ctgctgctga 
tgccacaaaa 

aggaaatttc 

28981 ggggaccaag acctaatcag 

29041 tttgctccaa gtgcctctgc 
agtcacacct 

29101 tcgggaacat ggctgactta 
tccacaattc 

29161 aaagacaacg tcatactgct 
cccaccaaca 

29221 gagcccaaaa aggacaaaaa 
gcagagacaa 

2 9281 aagaagcagc ccactgtgac 
ctccagacaa 

29341 cttcaaaatt ccatgagtgg 
actcatgatg 

2 9401 accacacaag gcagatgggc 

. 29461 tactcttgtg cagaatgaat 
gttaacttta 

29521 atctcacata gcaatcttta 
agagccacca 

29581 cattttcatc gaggccacgc 
ctagggagag 

2 9641 ctgcctatat ggaagagccc 
tccccatgtg 

29701 attttaataq cttcttagaa 

// 
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aggatctaga agaaccttgt accaaagtct 
cttgtatttc tctatgcagt tgcatatgca 
ctcatgtgct tgaagatcct tgtaaggtac 
tggctttgtg ctctaggaaa ggttttacct 
tgcacaccta atgttactat caactgtcaa 
tgttggtacc ttcatgaagg tcaccaaact 
aataaacgaa caaattaaaa tgtetgataa 
ccgcattaca tttggtggac ccacagattc 
tggggcaagg ccaaaacagc gccgacccca 
cacagctctc actcagcatg gcaaggagga 
aatcaacacc aatagtggte cagatgacca 
agctcgtggt ggtgacggca aaatgaaaga 
aggaactggc ccagaagctt cacttcccta 
tgcaactgag ggagcctuga atacacccaa 
caatgctgcc accgtgccac aacttcccca 
agagggaagc agaggcggca gtcaagcctc 
ttcaagaaat tcaactcctg gcagcagtag 
aggtggtgaa actgccctcg cgctattgct 
agtttctggt aaaggccaac aacaacaagg 
ggcatctaaa aagcctcgcc aaaaacgtac 
atttgggaga cgtggtccag aacaaaccca 
acaaggaact gattacaaac attggccgca 
attctttgga atgtcacgca fcggcatgga 
tcatggagcc attaaattgg augacaaaga 
gaacaagcac attgacgcat acaaaacatt 
gaaaaagact gatgaagctc agcccttgcc 
tcttcttcct gcggctgaca tggatgat.tt 
agcttctgct gattcaactc aggcataaac 
tatgtaaacg ttttcgcaat tccgtttacg 
tctcgtaact aaacagcaca agtaggttta 
atcaatgtgt aacattaggg aggacttgaa 
ggagtacgat cgagggtaca. gtgaataatg 
caatgtgtaa aattaatttt agtagtgcta 
gaacgacaaa aaaaaaaaaa aaaaaaaaaa a 
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irus B0333 2S,£: 0.0 

irus civet 019] 2;', 0 0.0 

spike glycoprotein [SARS eoronavirua B012; 2660 0.0 

spike glycoprotein I SARS coronavirus C025J : ; fe' 0.0 

spike glycoprotein [ SARS coronavirus A001] 2660 0.0 

spike glycoprotein [SARS coronavirus civetOlO] . . . :;;>*,:. 0.0 

spike glycoprotein [SARS coronavirus C028J 2$5>:9 0.0 

spike glycoprotein i SARS cororiav:! r us G013j 2;Y.:;o 0 . 0 

spike glycoprotein [SARS coronavirus civetOO'V] ? 0.0 

spike glycoprotein [SARS coronavirus A022J >gb... 0 . 0 

spike glycoprotein [recombinant coronavirus] 7 2 • - : 0.0 

spike protein [Bat SARS CoV Rml/2004] >gb. . . ?J;Vk 0.0 

_ " ■ RecName: Full=Spike glycoprotein; Sho . . . :■■'<.' 0.0 

spike protein [Bat SARS CoV Rp3/2004] >sp. . . 0.0 H 

spike glycoprotein [Bat coronavirus HKU3] . . . 2040 0.0 □ 

spike glycoprotein [bat SARS coronavirus HKO.3-2] . 1 0.0 

spike protein [Bat coronavirus (BtCoV/273/2005) ] 20'.- 0.0 

;>..; spike protein [Bat SARS CoV Bf 1/2004 J >gb. . . /G.,\ 0.0 

spike glycoprotein [ SARS coronavirus GZ0403] 0.0 

S3 protein [SARS coronavirus GD322] 14&0 0.0 

S2 protein [SARS coronavirus GD322] 1103 0.0 

truncated SARS coronavirus spike glycoprotein . . . 101.3 0 . 0 

spike glycoprotein [ SARS coronavirus TW-HP.lj >... 0.0 

spike glycoprotein [SARS coronavirus TW-YM2] 0.0 

spike glycoprotein [SARS coronavirus TW-GD3 3 >. . . :J~J. 0 . 0 

spike glycoprotein [SARS coronavirus TW-GD4] ••"0 0.0 

spike glycoprotein [SARS coronavirus CUHK-L2] ,;.:G : 0.0 

Spike glycoprotein [Bat coronavirus HKU9-2] JQ? 0.0 

K i: spike glycoprotein [Bat coronavirus HKU9] . . . le-178 Esl 

Spike glycoprotein [Bat coronavirus HKU9-3] .JZ2 6e-176 

Spike glycoprotein [Bat coronavirus HKU9-4J JQ§ 6e-171 

2..J.J. spike glycoprotein [Bat coronavirus HKU5] . . . J>31 2e-166 Q 

spike protein [Bat coronavirus {BtCoV/133/20 . . . ...603 3e-l64 B 

spike glycoprotein [SARS coronavirus cw049] „„5|0 3e-163 

Spike glycoprotein [Bat coronavirus HKU5-2] .670 7e-163 

Spike glycoprotein [Bat coronavirus HKU5-3J >g. . . f?J 2e-162 

spike [Bat coronavirus (BtCoV/A434/2005) 3 .3.11 2e-161 

Spike glycoprotein [Bat coronavirus HKU4 - 3 j ...fi.L'i 9e-161 

Spike glycoprotein [Bat coronavirus HKU4-2] ...ill le-160 

3 : j spike glycoprotein [Bat coronavirus HKU4] . . . ...572 le-160 
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CDC Lab Sequences Genome of New Coronavirus 



CDC News 
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The Centers for Disease Control and Prevention (CDC) announced today 
that it has sequenced the genome for the coronavirus believed to be 
responsible for the global epidemic of severe acute respiratory syndrome 
or SARS. The CDC sequence is nearly identical to that determined by a 
Canadian laboratory late last week. The significant difference is that the 
CDC-deterrnlned sequence has 15 additional nucleotides, which provides 
the important beginning of the sequence, CDC scientists said. 



Related Links 



The results came just 12 days after a team of 10 scientists, supported by 
numerous technicians, began working around the clock to grow cells taken 
from a throat culture taken from one of the SARS patients in Vero cells 
(African green monkey kidney cells) in order to reproduce the ribonucleic 
acid (RNA) of the disease-causing coronavirus. The new sequence has 
29,727 nucleotides, which places It well within the typical RNA boundaries 
for coronaviruses. Members of this viral family tend to have between 
29,000 and 31,000 nucleotides. 



Identifying the genetic sequence of a new virus is important to efforts to 
treat or prevent it, said Dr. Julie Gerberding, CDC director. "Research 
laboratories can use this information to begin to target antiviral drugs, to 
form the basis for developing vaccines, and to develop diagnostic tests 
that can lead to early detection." 
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In sequencing the genome, CDC scientists worked closely with 
coronavirus experts at academic institutions across the United States 
"This is an active, working community of scientific experts who have been 
contributing their knowledge and expertise throughout this investigation," 
said William Bellini, Ph.D., SARS laboratory team coordinator. 
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The nearly identical findings in the US and Canada are important because 
they were derived from different individuals who were infected in different 
countries. This suggests that the virus probably originated from a common 
source. 

The CDC's analysis of the virus is far from finished, officials emphasized 
Because coronaviruses have the ability to mutate rapidly, scientists will 
compare the sequences from viruses isolated in cell culture to those 
obtained from diseased tissues taken from SARS patients. "This is 
essentially a draft. Now we need to see if what we have identified in the 
laboratory matches what's causing disease in patients," Bellini said. 



But the groundbreaking work of isolating the genomic sequence speeds 
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the task of comparison. 



### 



CDC protects people's health and safety by preventing and controlling 
diseases and injuries; enhances health decisions by providing credible 
information on critical health issues; and promotes healthy living through 
strong partnerships with local, national and international organizations. 
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